A RIGOROUS PROOF OF THE CAVITY METHOD FOR COUNTING MATCHINGS 

MOHSEN BAYATJi AND CHANDRA NAIR^ 



Abstract. In this paper we rigorously prove the vaUdity of the cavity method for the problem of counting 
the number of matchings in graphs with large girth. Cavity method is an important heuristic developed 
by statistical physicists that has lead to the development of faster distributed algorithms for problems in 
various combinatorial optimization problems. The validity of the approach has been supported mostly by 
numerical simulations. In this paper we prove the validity of cavity method for the problem of counting 
matchings using rigorous techniques. We hope that these rigorous approaches will finally help us establish 
the validity of the cavity method in general. 



1. INTRODUCTION 

1.1. Motivation. Distributed message passing algoritlims lilce belief propagation have been around for 
over a decade now |14[ I2UI I18L ^ . Recently some important problems in combinatorial optimization have 
seen faster distributed algorithms - motivated using a heuristic technique in statistical physics called the 
cavity method - that seem to solve problem instances much larger than what was previously feasible. This 
method has also led to analytical predictions about some threshold phenomenon in various problems of 
cross-disciplinary interest. Some examples of its application include the satisfiability threshold for random 
constraint satisfaction |lll I12| and the corresponding survey-propagation algorithm , iterative decoding 
algorithms in multi-user CDMA [H], etc. 

However, very few rigorous results arc known concerning the validity of the cavity method and the 
convergence of the algorithms. In this paper we wish to add to the body of rigorous results [3 El ITB) 
supporting the predictions of the cavity method by showing its correctness for the problem of counting 
matchings in large sparse graphs. We borrow some of the techniques from Gamarnik et.al. [HI but we hope 
that some newer lines of the argument (e.g. showing validity of the free energy shifts) could lead to useful 
insights into the validity of the method for other instances in which the method was applied. 

The algorithms generated using this method in some instances resemble the naive believe propagation 
equations, whereas in some other instances they resemble two-layered belief propagation (or survey propaga- 
tion) equations, and in few other cases are significantly more involved. In the problem of counting matchings, 
the equations generated using the cavity method resemble the naive belief propagation equations. In this 
paper we show the convergence of the cavity equations and the uniqueness of the fixed points for arbitrary 
graphs G. In general such convergence results arc not known except for trees or graphs with exactly one 
cycle UH. 

1.2. The matching problem. Counting the number and size of matchings on various types of random 
graphs has been a classical problem in graph theory. This problem has been intensively studied for a long 
time by mathematicians and computer scientists |13| . Very recently Zdebrova and Mezard |21| used the 
cavity method to solve this problem. They believed that the results obtained by this heuristic are exact for 
the matching problem and using this approach: a) they derived an algorithm that computes the entropy for 
arbitrary graphs with girth that diverges in the large size limit and b) derived analytical results for regular 
and Erdos-Rcnyi random graph ensembles. 

We first define the problem of finding the number of perfect matchings in a simple graph and then describe 
the cavity equations for solving it. 
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1.2.1. Problem setup and notation. Consider a graph G = {V, E) with n vertices V, and edge-set E. Through- 
out this paper we will always assume that G is simple (i.e. G has no multi-edge or self-loops) and undirected. 
The girth of a graph G is defined as the length of the shortest cycle. A matching is a subset of edges M C E 
such that no two edges of M have a common endpoint. Let \M\ denote the size of matching M and let M* 
be a matching of maximum size. If \M*\ = n/2 then M* is called a perfect matching. 

Counting the number of perfect matchings in a graph G is shown to be #-P complete |17| . (i.e. in general 
no polynomial-time algorithm can find the exact number of perfect matchings of G unless P ~ NP). Since it 
is widely believed that P ^ NP, many approaches have been focused on finding polynomial-time algorithms 
for approximately counting the number of perfect matchings [7| |H1 13 ■ 

Let a,b, . . . denote the vertices of G and i,j,--- denote the edges. For every vertex a G G let N{a) denote 
the vertex-neighborhood of vertex a, i.e. N{a) = {b : (a, b) S E}, and let E{a) denote the edge-neighborhood 
of vertex a, i.e. the set of edges in E that have the vertex a as an endpoint. 

Describe a matching M by variables Si = Safi G {0, 1} assigned to each edge i = {a,b) ^ E with 



Si 



1 edge i £ M 
edge i ^ M 

Since M is a matching, it follows that for any vertex a V: 

For every matching M C E define its energy to be its number of unmatched vertices: 

Eg{M) = Y,Ea{M)=n-2\M\ 

where Ea{M) = 1 - E&GAr(a) 

This induces a probability distribution (called the Gibb's distribution) on the set of all matchings, A4{G), 
of the graph G, defined by: 



where /3 is a positive number and is called inverse temperature. The normalizing term Zq{P) = X]a/ 



^-PEg^M) 

is called partition Junction. 

The partition function of an empty graph is defined as 1 . To simplify notation we will omit the dependence 
on (3 and write Pq instead of Pg,i3 a-nd Zg instead of Zg{(3). 

Let a configuration denote a collection of edges S C E. Observe that Pg can also be represented on the 
set of all configurations, as below. Let for all a € ^ 

Then it follows that 

Pg{s) = ^11 MS). 

Note that Eg{M) is minimized when M is a maximum size matching and for large values of /3 the partition 
function is dominated by the terms corresponding to maximum size matchings of G. Hence for /3 3> 

^^G^-^iogz,«:iMM + ^,(,.r) 

p p 

where Ng is number of maximum size matchings in G. Fg{P) is defined as the free-energy of the system. 
Observe that for large /3, we have 

Eg{M*) « ^(/?^g) 
log(7VG) « p(^{PFg)-Fg 



with the approximation becoming exact as /? — > oo. 

The cavity method of statistical physics is a heuristic that is used to evaluate the partition function. In 
the next section wc will state the equations derived in |21| using the cavity approach and then prove that 
these equations compute the partition function exactly for sparse graphs. 

Remark 1.1. In the first few sections the term sparse graph is used loosely to mean graphs that have no short 
loops. The precise dependence needed between the length of the shortest loop and the size of the graphs 
(measured in terms of the number of vertices and number of edges) will be spelt out in the final section. 

1.3. The cavity-claims for the matching problem. Let h^^°- : E x V M. he the 'message' that edge 
i = (a, 6) conveys to vertex a, one of its end-points. Note that there are 2\E\ messages as there are two 
messages for each edge. 

The following two claims form the algorithmic and analytical crux of the cavity method for this problem. 
Claim 1.2 (Zdeborova-Mezard). Consider the iterative equation defined by 



1 



l) = -^log 



(1.1) 



jeE{b)\i 

These iterative equations converge to a unique fixed point for a large sparse graph whose girth diverges 
with the size of the graph. 

Let /i*"*" be the unique fixed points of the system of equations in the above claim, i.e. 
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log 



jeE{b)\i 



(1.2) 



Claim 1.3 (Zdeborova-Mezard). The free energy for a single large sparse graph is given by 



where 



E - 



A_Fa is often called the free energy shift corresponding to the removal of a vertex a and its associated 
edges, and Ai^; the free energy shift corresponding to the removal of the edge i. 

Remark 1.4. Our outline of the proof is as follows: first we will prove the cavity-claims for the case when 
G is a tree, and then we will proceed to establish this for a large sparse graph. Validity of Claim 11.21 is 
well-known for the case of a tree and the main result in the next section is to show that Claim [T31 is exact 
as well. 



2. The validity of the cavity-claims on a tree 

In this section, we shall assume that the graph G is a tree. Note that removing any edge i — (a, b) from 
G splits the graph into two subgraphs: G*'" containing the vertex a, and G''^ containing the vertex b. 

Remark 2.1. For any graph G = (V, E) with a & V and i G E, let Gi denote the graph with edge i removed. 
Further, let Ga denote the graph G with vertex a and with all edges adjacent to a removed. In Figure ITTl 
observe that GJj" is formed from G*'" by removing vertex a and all its adjacent edges. 



Observe that, 



Defining 



Zg^ 
Zg 



Figure 2.1. The graph G and the various sub-graphs 



we see that 



Note that 



Thus, 



Zg 

" ie_E(a) 

Lemma 2.2. Free energy Fq, can he expressed as the sum of free energy shifts when G is a tree, 

a i 

Proof. Observe that 

asy ieE aeV ieE ^ 

- TT -^G TT ZG^ aZG,.b (a) ^G^' (j)) _ „-I3Fg 

Here (a) foUows form the fact that 

n n ^G ^. =n^G "^G.^ 

aGVie£;(a) ieE 

and (6) foUows form the fact that in a tree \E\ = \V\ — 1. 

To complete the proof of Claim ITT^ for the case that G is a tree we need to show the following 



(i) The variables /i*^" defined in (|2.1|) satisfy equation (|1.2|l . 
(m) The equations H1.2() have a unique fixed point. 



Remark 2.3. The fact that the equations H1.2|) have a unique fixed point is a weU-known fact for the case 
when G is a tree. For general graphs G, the convergence and the uniqueness is not known. One of the main 
technical ingredients in this paper is to establish the convergence and the uniqueness for general graphs as 
well. 

Lemma 2.4. The variables defined in H2.1|l satisfy equation H1.2|l . 

Proof. We need to show that 

1 



log 



E 

j£E{b)\i 



This is equivalent to showing that 



E 



jeE{b) 

Now using the equations H2.3|l . (|2.1|l we see that this reduces to showing 

Zg 



Observe that this follows from the following: Zg = ZQi,bZQi,a + Z^i.bZ^i.a and Zg^ = Z^i.aZ^ 



□ 



3. Convergence of the iterative equations 

Consider any simple graph G and consider the iterative equations defined on it according to The 
proof for the convergence of the iterative equations is based on the following lemma: 



Lemma 3.1. Let f : HV" — > H be a real valued function defined as follows: 



fix) 



ri°gH + E 



then for any x,y(lz 



\fix)-.f{y)\ < 



k=l 



r + e-'^P 



\x- y\\ 



(3.1) 



(3.2) 



Proof. Since /(x) is differentiable, multi-variable version of the mean value theorem implies that for any 
X, y G R'''* there exist a point z on the line-segment connecting x and y in R''" such that: 

/(x) - /(y) = V/(z) • (x - y) 

From Holder's inequality it follows that 

|/(x)-/(y)|<||V/(z)||i||x-y|U 

In order to show ()3.2|l it suffices to show that ||V/(z)||i < . Note that, 



|V/(z)|li==^ 



— I 

dzM 



E 



E 



For simplicity of notation let Ak = — e^^IF' ^^^^^ '^^^ obtains 



Now using < Ak < e^: 

This completes the proof of Lemma 13.11 □ 

Next we will use Lemma l3. II to prove convergence of the cavity equations for any graph G. 

Theorem 3.2. For any graph G the set of cavity equations 1^1.1}) converges to a unique fixed point independent 
of its initial conditions. 

Proof. Consider an arbitrary initial condition {/i*^"(0)}. The iterative equations in states that 



\t + l) = -j^log 



E 

jeE{b)\i 

Define -F'(x) to be the multi- valued function from R^l^l to R2|^I such that 

{h'^-'[t + l)} = F{{h'^-'{i)}). 

Consider the two-iterate of function F, i.e. let F^ = F o F. Let F^ ~ (/i(x), . . . , /2|_b|(x)) where each /; 
is a real valued function on R^'-^L 

Observe that each function /^(x) can be written in the form (|3.1() where r, s < maxagy(deg(a)). Let 
A = maxagy (deg(a)), the maximum degree of a vertex in G. Now using Lemma l3. II for any t > 2, we have: 

m-^{t + 2)} - {/i''^"(t)}||oo - \\F\W-^}{t)) F\{h^-%t - 2)))\\^ 

= max - fk{{9'-'^{t - 2)}|) < _ _ 2)}|U ,^ ^, 

i<fc<2|B| e + A (3.3) 

A 



Thus the sequence {h^^°'{t)} is Cauchy and hence converges to a point {/i*^"} G R^'^L This shows that 
the equations converge for any graph G. 

To show uniqueness consider two different initial conditions {/i'^''(0)} and {g*^"(0)}. Using the same 
argument as in equation 13. 3() one has: 

A V'\.. ,. . . , ... (3-4) 



- y-^^^) iii^'^''(o)}-{5'""^(o)} 

so both sequences {/i*~*°(t)} and {g*~*''(t)} converge to the same point and this contradicts the existence of 
multiple fixed points. 

This completes the proof of the Theorem 13.21 and shows that the equations converge to a unique 

fixed point for any graph. □ 

In the next section we show that validity of the cavity-claims when the graph G has a large girth. 

4. Validity of the Cavity-Claims for graphs with large girth 

Theorem 13.21 proves the validity of the Claim 11.21 for arbitrary graphs and in particular for graphs with 
large girth. Therefore it suffices to show the validity of Claim [T31 for graphs with large girth. Before starting 
the proof, we note the following bounds on the values of the fixed points of the equation p. 1(1 . The proof of 
this lemma is straightforward and is omitted. 

Lemma 4.1. Let he the the unique fixed points of the iterative equations on any graph G with 

maximum degree A. Then 

log [e-f^ + (A - l)e''] < h'^'' < 1. 



Consider a fixed graph G of size n. Let denote the maximum distance such that the subgraph, G(a; r^), 
formed using the vertices that are within a distance from the vertex a is a tree. Let the vertices in G(a; To) 
be denoted by V{a;ra)- Consider the set of edges, C, that connect V{a]ra) and V{a]raY- Further let the 
subgraph formed by the vertices V{a]raY be denoted as H{a;ra)- 

This decomposes the original graph G into three parts: the subgraph G{a; Tq), the set of edges C, and the 
subgraph H{a; Tq). Pick any subset of the edges D C G. Let J^d C A4{G) denote the set of matchings in G 
that use precisely the subset of edges D in G. Let Vc C V{a; Tq) denote the set of vertices in G{a; rg) that 
are the endpoints of the edges G. Let Vd C Vc denote the set of vertices in G{a; Va) that are the endpoints 
of the edges D. Further, let Ud denote the set of vertices in H{a; Va) that are the endpoints of the edges D. 

Denote G{a; rj, — 1) as the sub-graph formed using the vertices that are within distance Ta — 1 from vertex 
a. Observe that 

G{a;ra) D GvD{a;ra) D G{a;ra - 1) 

Lemma 4.2. Let Di and D2 be two different subsets of G . Let {h^~*^}, {g-''^^} be the unique fixed points of 
the iterative equations on the graphs Gvo^ Gvo^ ■ Then 

/ A x(r„-l)/2 



Proof. Since Gy^^ (a; Tq), Gy^a ('^5 ''a) ^ G{a; ra — 1) we can set {h^^'^iO)} = {^■'^''(O)} for the messages in 
the sub-graph G(a; — 2). 

To bound the difference in the messages at the boundary depending on choice of D, observe the following: 
Let V be any vertex at distance — 1 from a and let u be any neighbor of v that is at distance — 2. Let 
< 5i, < A denote the degree of the vertex v in the graphs Gvo^ i^-j ^a), Gvb^ (a; r^) respectively Let e 
denote the edge joining v to u. Then from Lemma l4.1l we see that 

|^,_„_^e-,„| < l^^g [1 + (A - 1)62/5] . 

We can use Lemma 13.11 to determine the propagation of this difference to the messages at a. It is not 
difficult to see from repeated use of Lemma l3. II that 

/ A s(r.-l)/2 

□ 

For simplicity of notation let us denote 



i.= ilog[l + (A-l)e2^], K 



It is easy to see that v <i for large enough 13. 

Let i be an edge that is connected to the vertex a. We will show that the free-energy shift AF^ for the 
graph G can be approximated by the free energy shift corresponding to the tree G(a,ra). 

Lemma 4.3. Let 6 = i^X (''"-i^/^. Then, 

.ff, Z{G{a;ra)) ^ ^ ^(G(a;r„)) 

Z(G,(a;ra)) - Z(G.) - Z(G,{a-ra)) 

Proof. Observe that 

Z{G) = ^ Z{Gvo (a; ra))Z{Hu^ (a; r,)) 

D 

Similarly for Gi, obtained by removing edge i in G, we obtain 

Z(GO = Z{GvoAa\ ra))Z{Hu^ (a; r^)) 

D 



Let /i0~^° be the converged values of the iterative equations for the case when D = 0. Combining the 
result for trees and our bounds on the converged values of h^^" for different initial conditions in Lemma 14.21 
we know that 

Therefore for all choices of D the following holds 

Z{G{a;ra)) ^ Z{Gv„ia;ra)) ^ Z{G{a;ra)) ^^^^ 

Z{Gi{a;ra)) ' ~ Z{Gvo.tia;ra)) ~ Z{Gi{a;ra)) 
Using this result and the decompositions of Z{G) and Z{Gi) presented above we obtain that 

Z(G(a;r.)) < Z{G)_ ^ Z(G(a;rJ) 

Z(G.(a;ra)) - Z{G^) - Z{G,{a;ra)) 

□ 

Observing that the boundary conditions do not influence the value of the fixed points of the iterative 

equations for the graph G, we can infer that 

\ / Z{Gi) V / 

In a very similar manner to the removal of an edge, we can also show that for some 5 < log(A) + S 

Z{G{a;ra)) ^ Z{G) ^ Z(G(a;rJ) 

Z{Ga{a;ra))' - Z{Ga) ' ZiGa{a;ra)) ' ^'^ 
Similar to the case of the removal of the edge, the above equation implies that 



i£E{a) I \ a) \ i(zE{a) 



4.1. Proof of Claim [TT3I for graphs v^fith large girth. 

Let G be a graph that satisfies the girth condition. 



girth(G) > 8(l"g" + 1"S^ + 21"g"^ + log A + log T5i^) 

log-^ 



where m = |y| + \E\ 



Theorem 4.4. For any e,/3 > 0. Let G be a graph satisfying the girth condition. Then the free energy shifts 
approximates Zq within factor 1 + e, i.e. 



Zcil + e)-^ <XG^^=^^<ZG{l + e). (4.3) 



ri ^ 

n ^ 



Note that when E = then Xq = 1 and so (|4.3() holds. Let E = {ii, i2, . ■ . , i|£;|} be an ordering of edges 
and g ~ [girth(G)/4 — Ij. The following lemma is crucial to prove Theorem 14.41 

Lemma 4.5. For alll < r < \E\: 



where E^ = {ii, . . . , ir} , K = ^^-'iii and m ^ n + \E\. 

Before proving Lemma 14.51 we will show how it can be used to prove Theorem 14.41 



Proof of Theorem \4.4\ Assuming Lemma |4 . 51 and taking the telescopic product of r = 1 to \E\, we obtain 

r=l ^Ge^_i ^Ge^_i ^Ge^_i 

and hence 

The assumptions on the girth of the graph in (|4.2|) implies that e'^'^'^™ -^"^ < 1 + e and this completes the 

proof. □ 

Therefore, all that remains is to prove Lemma 14.51 

Proof of Lemma \4.5\ We need to show that 

Zge,. uA0m.K'J^^ <- ^Ge^ ^Ge. uAPniK^i^^ 



^Ge^ , ~ -'^Ge„ , ~ Zge^ , 



Observe that, 



I n ^"-^ \ n 



Xge n 



-1 



•^Gp I Zg 



eE\Er Zg„ , ; Zg 



r-1- 



(4.4) 



Let Or be an endpoint of the edge ir- We will estimate the product 

-1 

n 



n, 



efiVB. Zg„ , \ Zg 



by partitioning the vertices and edges into two groups; those that are in G{ar,g) and those that are outside 
G{ar,g). 

Using equation (|4.1() whenever a ^ Ge^^i [ar] g), we have 

2'Ge,,_i \^GE^_i,„y 
Similarly whenever i ^ GE^_i{o.r]g) then from Lemma 14.31 we have 

g-^A/SK'/^ < ^Ge,- / ^Ge,-,. \ ^ ^yAm<"\ (4 6) 

^Ge,_i V^Ge_i,./ 

Now wc consider the case when a, i G G{ar, g). 

Since Gs^^iiar', g) C G'b^_j (a,; 2(7) and both graphs are trees, we can use H4.H) for a G GEr._i{ar', g) to 
obtain 

g-t/A/a^g/^ ^Ge^ ^ ^GE^(a.;g) ^ ^Ge^ ^t/A;3i<-«/" 

^GBr:a ~ ^GEra(a,-:g) " ^GEr,a 

7 7 7 (4-7) 

^-iyA/3g«/^ ^Ge,_i ^ ^GE,_i(ar;3) ^ ^Ge,_i ^^AfiK"''^ 

^GE,_i,a ~ ^GE,_i,a(a.;9) " ^GE,„i,a 

Similarly, for i e GE^_iicir', g) using Lemma 14.31 we have 

g-ivAffJ^g/" ^Ge^ ^ ^GE^(a.,g) ^ ^Ge^ ^uAUK"''' 

^Ge,,, ~ ^Ge,,,(o,,3) ~ ^GE,,i 
7 7 7 (4.8) 

^-vAfjRo/'^ ^Ge,_i ^ ^GE,_i(a.,g) ^Ge,_i ^t^AffJfg/" 

^Ge,_i,, ~ ^GB,_i.i(a,,g) ~ ^GE,_i.i 



Note that, 



(4.9) 



Using equations (|4.5|) . (|4.6|) . in the first product and equations (|4.7|l . H4.8|l in the second product (and 
from the definition of Xq), we obtain 

-'^GE^_i(a,;g) ~ ^Ge^_i ~ Xq E^_^{a^;g) 

Lemma |2 . 21 implies that Xq ~ Zq when G is a tree. Therefore, since GE,X'^r] g)i GEr-i{o-r'i 9) a-re trees 
we can replace '^Er'°'--g) TffW\^ ^^j-^j subsequently using Lemma ESI to replace ^e^ -^w^^ 

7 — we complete the proof of Lemma [4. 51 □ 

^GEj J 

5. CONCLUSIONS AND FUTURE WORKS 

In this paper we show the validity of the cavity method for the problem of counting the number of 
matchings for graphs with large girth. The girth condition we have in this paper is quite restrictive and 
several graphs of practical relevance do not meet this condition. However we hope that the methods presented 
here can be extended in a straightforward manner to random regular graphs and Erdos-Renyi graphs. This 
would lead to, as observed in tighter estimates for counting the number of matchings in such graphs. 

We also demonstrate the convergence and uniqueness of the iterative equations for arbitrary graphs. The 
techniques used in this paper do not heavily depend on the nature of the problem and therefore there is a 
good possibility of these techniques having a wider interest and applicability to other important problems 
were cavity method has been applied. 
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